Lexical set

A lexical set is a group of words that share a similar feature.

1 Wells Standard Lexical Sets for English
2 Use in other languages
3 See also
4 References
5 Bibliography
6 External links

Wells Standard Lexical Sets for English

The Standard Lexical Sets for English introduced by John C. Wells in Accents of English^[1] are in wide usage. Wells defined each lexical set on the basis of the pronunciation of words in two reference accents, which he calls RP and GenAm.

RP refers to Received Pronunciation, the traditionally prestigious accent in England.^[2]
GenAm refers to a conservative accent of the General American type which has resisted the horse-hoarse and cot–caught mergers.^[3]

Wells classifies words of the English language into 24 lexical sets on the basis of the pronunciation of the vowel of their stressed syllable in the two reference accents. Each lexical set is named after a representative keyword, as follows:^[4]

Key word	RP	GenAm	Example words
KIT	i ɪ	i ɪ	ship, rip, dim, spirit
DRESS	e e	e ɛ	step, ebb, hem, terror
TRAP	a æ	a æ	bad, cab, ham, arrow
LOT	o ɒ	a ɑ	stop, rob, swan
STRUT	u ʌ	u ʌ	cub, rub, hum
FOOT	u ʊ	u ʊ	full, look, could
BATH	a ɑː	a æ	staff, clasp, dance
CLOTH	o ɒ	o ɔ	cough, long, laurel, origin
NURSE	e ɜː	e ɜr	hurt, term, work
FLEECE	i iː	i i	seed, key, seize
FACE	e eɪ	e eɪ	weight, rein, steak
PALM	a ɑː	a ɑ*	calm, bra, father
THOUGHT	o ɔː	o ɔ*	taut, hawk, broad
GOAT	o əʊ	o o	soap, soul, home
GOOSE	u uː	u u	who, group, few
PRICE	a aɪ	a aɪ	ripe, tribe, aisle, choir
CHOICE	o ɔɪ	o ɔɪ	boy, void, coin
MOUTH	a aʊ	a aʊ	pouch, noun, crowd, flower
NEAR	i ɪə	i ɪr	beer, pier, fierce, serious
SQUARE	e ɛə	e ɛr	care, air, wear, Mary
START	a ɑː	a ɑr	far, sharp, farm, safari
NORTH	o ɔː	o ɔr	war, storm, for, aural
FORCE	o ɔː	o or	floor, coarse, ore, oral
CURE	u ʊə	u ʊr	poor, tour, fury

* not followed by /r/

For example, the word rod is pronounced /rɒd/ in RP and /rɑd/ in GenAm. It therefore belongs in the LOT lexical set. Weary is pronounced /ˈwɪərɪ/ in RP and /ˈwɪri/ in GenAm, and thus belongs in the NEAR lexical set.

Some words of the English language do not belong to any lexical set. For example, the stressed syllable of tomato is pronounced /ɑː/ in RP, and /eɪ/ in GenAm, a combination which is very unusual, and is not covered by any of the 24 lexical sets above.^[5]

Choice of Keywords

Wells explains his choice of keywords ("kit", "fleece", etc.) as follows:

[t]he keywords have been chosen in such a way that clarity is maximized: whatever accent of English they are spoken in, they can hardly be mistaken for other words. Although fleece is not the commonest of words, it cannot be mistaken for a word with some other vowel; whereas beat, say, if we had chosen it instead, would have been subject to the drawback that one man's pronunciation of beat may sound like another's pronunciation of bait or bit.^[6]

Wherever possible the keywords end in a voiceless alveolar or dental consonant.^[6]

Unstressed vowels

Wells also describes three sets of words based on their word-final unstressed vowels. Although not included in the standard 24 lexical sets, these "have indexical and diagnostic value in distinguishing accents".^[7]

Keyword	Examples
happY	(silly, Tony, merry)
lettER	(beggar, martyr, visor)
commA	(China, sofa)

Uses

The Standard Lexical Sets of Wells are widely used to discuss the phonological and phonetic systems of different accents of English in a clear and concise manner. Although based solely on RP and GenAm, the Standard Lexical Sets have proven useful in describing many other accents of English. This is true because, in many dialects, the words in all or most of the sets are pronounced with similar or identical stressed vowels. Wells himself uses the Lexical Sets most prominently to give "tables of lexical incidence" for all the various accents he discusses in his work. For example, here is the table of lexical incidence he gives for Newfoundland English:^[8]

KIT	ɪ	FLEECE	iː	NEAR	ɛr
DRESS	ɛ	FACE	ɛː, ɛɪ	SQUARE	ɛr
TRAP	æ	PALM	æ, ɑː	START	ær
LOT	ɒ	THOUGHT	ɑː	NORTH	ɔ̈r
STRUT	ɔ̈	GOAT	ʌʊ	FORCE	ɔ̈r
FOOT	ʊ	GOOSE	uː	CURE	ɔ̈r
BATH	æ	PRICE	əɪ	happY	[i]
CLOTH	ɑː	CHOICE	əɪ	lettER	ər [ɚ]
NURSE	ɜr	MOUTH	əu	commA	ə

The table indicates that, for example, Newfoundland English uses the /ɪ/ phoneme for words in the KIT lexical set, and that the NORTH, FORCE and CURE sets are all pronounced with the same vowel /ɔ̈r/. Note that some lexical sets, such as FACE, are given with more than one pronunciation: this indicates that not all words in the FACE lexical set are pronounced similarly (in this case because Newfoundland English has not fully undergone the pane–pain merger).^[9]

Wells also uses the Standard Lexical Sets to refer to "the vowel sound used for the standard lexical set in question in the accent under discussion":^[10] Thus, for example, in describing the Newfoundland accent, Wells writes that "KIT and DRESS are reportedly often merged as [ɪ]",^[11] meaning that the stressed syllables of words in the KIT lexical set and words in the DRESS lexical set are reportedly often pronounced identically with the vowel [ɪ].

Lexical sets may also be used to describe splits and mergers. For example, RP, along with most non-rhotic accents, pronounces words such as "father" and "farther" identically. This can be described more economically as the merger of the PALM and START lexical sets. Most North American accents make "father" rhyme with "bother". This can be described as the merger of the PALM and LOT lexical sets.

Origin

In a 2010 blog post, Wells wrote:

I sometimes think that a century from now my lexical sets will be the one thing I shall be remembered for. Yet I dreamt them up over a weekend, frustrated with the incoherent mess of symbols used in such contemporary publications as Weinreich's "Is a structural dialectology possible?".^[12]

He also wrote that he claimed no copyright in the Standard Lexical Sets, and that everyone was "free to make whatever use of them they wish".^[12]

Extensions for English

Some varieties of English make distinctions in stressed vowels that are not captured by the 24 lexical sets. For example, some Irish and Scottish accents that have not undergone the fern–fir–fur merger split the NURSE lexical set into multiple subsets. For such accents, the 24 Wells lexical sets may be inadequate. Because of this, a work devoted to Irish English may split the Wells NURSE set into two subsets, a new, smaller NURSE set and a TERM set.^[13]

Use in other languages

Lexical sets have also been used to describe the pronunciation of other languages, such as French.^[14]

References

^ Wells (1982)
^ Wells (1982), pp. 117ff
^ Wells (1982), p. 118
^ Wells (1982), pp. 123ff
^ Wells (1982), p. 122
^ ^a ^b Wells (1982), p 123
^ Wells (1982), p. 165
^ Wells (1982 b), p. 499
^ Wells (1982 b), p. 498
^ Wells (1982), p. 124
^ Wells (1982 b), p. 500
^ ^a ^b "John Wells's phonetic blog: lexical sets". 2010-02-02. http://phonetic-blog.blogspot.com/2010/02/lexical-sets.html. Retrieved 2010-04-30.
^ Hickey, Raymond. A sound atlas of Irish English. Walter de Gruyter, 2004. pp. 54–55. ISBN 311018298X. http://books.google.com/books?id=TZK9cSYRrjMC.
^ Armstrong, Nigel (2001). Social and stylistic variation in spoken French: a comparative approach. Amsterdam: John Benjamins. pp. 100ff. ISBN 9027218390. http://books.google.com/books?id=hu5eYY_4nxcC.

Bibliography

Wells, John C. (1982). Accents of English I: An Introduction. Cambridge, New York: Cambridge University Press. ISBN 0-521-29719-2. http://books.google.com/books?id=UJQwf05yzqYC.
Wells, John C. (1982 b). Accents of English III: Beyond the British Isles. Cambridge, New York: Cambridge University Press. ISBN 0521285410. http://books.google.com/books?id=484eVQ7t8TMC.